Am J Pharmacogenomics 2004; 4 (6): 383-393
نویسنده
چکیده
. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 383 1. Information Needs . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 383 2. Text Mining . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 384 2.1 Challenges and Solutions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 387 3. Case Study: Literature Profiling . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 388 4. Knowledge Discovery . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 390 5. Conclusion . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 392 It is now obvious that the rate-limiting step in high throughput experimentation is neither data acquisition nor Abstract analysis, but rather our ability to interpret data on a genome-wide scale. Indeed, the explosion of data sampling capacity combined with increasing publication rates greatly impairs our ability to find meaning in vast collections of data. In order to support data interpretation, bioinformatic tools are needed to identify critical information contained in large bodies of literature. However, extracting knowledge embedded in free text is an arduous task, compounded in the biomedical field by an inconsistent gene nomenclature, domain-specific language and restricted access to full text articles. This paper presents a selection of currently available biomedical literature mining software. These tools rely on statistic and, more recently, semantic analyses (Natural Language Processing) to automatically extract information from the literature. In addition, a literature mining strategy has been developed to explore patterns of term occurrences in abstracts. This method automatically identifies relevant keywords in collections of abstracts, and uses a pattern discovery algorithm to generate a visual interface for exploring functional associations among genes. Term occurrence heatmaps can also be combined with gene expression profiles to provide valuable functional annotations. Furthermore, as demonstrated with tumor cell line literature profiling results, this approach can be applied to a variety of themes beyond genomic data analysis. Altogether, these examples illustrate how literature analysis can be employed to support knowledge discovery in biomedical research. 1. Information Needs foremost characterized by the transition from local to global scale studies. In order to extract meaning from the vast amounts of data The landscape of biomedical research has been transformed by generated by what has become a data intensive field, biomedical the widespread embrace of high-throughput experimental technolscientists must gain the ability to leverage knowledge accumulated ogies which have collectively given birth to the ‘omics’ fields (e.g. in the literature. genomics, proteomics, transcriptomics, metabolonomics[1]). The Knowledge is essential for data interpretation and investigators revolution, however, is more conceptual than technological, and while the ‘omics era’ is definitively high-tech, it is first and must maintain an adequate level of proficiency by keeping up with
منابع مشابه
A new case of Finnish-type congenital nephrotic syndrome, neuromuscular symptoms and early death
1. Sánchez-Conde M, Gil P, Sánchez-Somolinos M et al. Hepatic and renal safety profile of tenofovir in HIV-infected patients with hepatitis C, including patients on interferon plus ribavirin. HIV Clin Trials 2005; 6: 278–280 2. D’Ythurbide G, Goujard C, Méchaı̈ F et al. Fanconi syndrome and nephrogenic diabetes insipidus associated with didanosine therapy in HIV infection: a case report and lite...
متن کاملCommentary: reporting and assessing evidence for interaction: why, when and how?
gene-by-environment interaction studies: revelations and remedies. Epidemiology 2011;22:400–07. 4 Weinberg CR, Shi M, Umbach DM. A sibling-augmented case-only design for assessing multiplicative gene-environment interaction. Am J Epidemiol 2011;174:1183–89. 5 Kistner EO, Shi M, Weinberg CR. Using cases and parents to study multiplicative gene-by-environment interaction. Am J Epidemiol 2009;170:...
متن کاملComparative analysis of different approaches to report diagnostic accuracy.
Annu Rev Nutr. 2004;24(1):401-431. 6. Mattes RD, Donnelly D. Relative contributions of dietary sodium sources. J Am Coll Nutr. 1991;10(4):383-393. 7. Dickinson BD, Havas S; Council on Science and Public Health, American Medical Association. Reducing the population burden of cardiovascular disease by reducing sodium intake: a report of the Council on Science and Public Health. Arch Intern Med. 2...
متن کاملA System for Dissolved oxygen Control in Industrial aeration tank
3 4 5 65 37 5 383 3 9 : : 3 5 3 ; 4 4 9 55 83 6 34 4 34 : 4 4 4 5 : :3 : 4 :4 < :53 4 : : : 4 9 = 4 3 73;8: 6 4 64:7 5 9 3 4 3 355 * 3 4 3 : : : 4 : 7 393 5 3 355 5 : 3 7 3 ;<:53 :4593 9 4 4 : 9 5 :4 35 4 355 64:7 5 < 6 : 4 3 9 4 4 : :73 3 5 4; 43 7 3:5 3734 5 9 355 8: :< 35 :4 3 4 3 4 4 35 383 3 9 6 : 5 3 :4593 9 4 4 7 3 5 * 3 4 5 65 37 :5 4835 : 3 8 : 7 3 5 7 : 4 9 3 55 83 6 34 4 34 : 4 53 ; ...
متن کاملCurrent Awareness on Comparative and Functional Genomics
2004. Special issue Toxicogenomics. Mutat Res 549: (1-2) Agaton C, Uhlen M, Hober S. 2004. Genome-based proteomics. Electrophoresis 25: (9) 1280. Asenjo JA, Andrews BA. 2004. Is there a rational method to purify proteins? From expert systems to proteomics. J Mol Recognit 17: (3) 236. Austin CP. 2004. The impact of the completed human genome sequence on the development of novel therapeutics for ...
متن کامل